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Abstract 

In this work we analyze the problem of linear correction of the reliability metrics (L- values) in BICM 
receivers. We want to find the correction factors that minimize the probability of error of a maximum 
likelihood decoder that uses the corrected L-values. To this end, we use the efficient approximation 
of the pairwise error probability in the domain of the cumulant generating functions (CGF) of the L- 
values and conclude that the optimal correction factors are equal to the twice of the saddlepoint of the 
CGF. We provide a simple numerical example of transmission in the presence of interference where 
we demonstrate a notable improvement attainable with the proposed method. The proposed method is 
compared with the one based on the maximization of generalized mutual information. 

Index Terms 

Logarithmic Likelihood Ratio, LLR, L-value, Mismatched L-values, Mismatched Decoding, Gen- 
eraUzed Mutual Information, Maximum Likelihood Decoding, ML, Pairwise Error Probability, PEP. 



I. Introduction 

The logarithmic likelihood ratios (LLRs, or L-values) calculated at the receiver for the trans- 
mitted bits, are a convenient representation of the likelihood of the observations and are often 
used in all of the processing operations in the receiver (such as "soft" detection, decoding. 
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to IEEE International Conference on Communications 2012. 
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iterative processing, etc). In this work we consider the so-called mismatched L-values, which 
only approximate the true L-values and to "correct" the mismatch, that may occur due to many 
independent reasons, we analyze the linear scaling of the mismatched L-values. Formulating the 
problem in the context of BICM receivers, we aim at the minimization of the probability of 
error of the maximum likelihood (ML) decoder that uses the corrected L-values. 

The L-value /„ of the bit C„ (transmitted at time n) is a well known way of representing the 
reliability of the transmitted bit. It is related to the observation y„ via 

'n = log -r, (1) 

PY„\cAyn\0) 

where PYn\c„iyn\b) is the probability density function (pdf) of the observation conditioned 
on the sent bit C„ = b. 

The L-values are basic signals/messages exchanged between the processing units. The mul- 
tiplications of probabilities required in many processing steps transform into addition of cor- 
responding L-values; the numerical simplicity of the resulting operations is the reason behind 
the popularity of the L-values. For example, in BICM receivers, the L-values are calculated 
by the front-end detector and then passed to the decoder [1]. In some cases, operations on the 
L-values are carried out before decoding as it happens when combining the signals obtained in 
independent transmissions of the same bit [2]. The L-values are also used in binary decoders 
that operate in an iterative fashion, e.g., turbo -decoders [ ] or message passing algorithms used 
for decoding of LDPC codes [ ] . 

In some situations, however, the L-values are not appropriately calculated, or are mismatched. 
Ignoring the mismatch when processing the L-values is, in general, suboptimal and to correct 
it, nonlinear operations on the L-values may be required. To make the correction simple, a 
linear operation (i.e., multiplication by a correction factor) is often considered. This idea was 
already studied in the context of BICM receivers [ ], turbo-decoding [6] [7], or LDPC decoding 
[4]. However, the correction factor was most often found through a brute-force search, that is, 
among the results obtained for different correction factors the one ensuring the best performance 
is deemed optimal. While this is a pragmatic approach when searching for one or two correction 
factors, it cannot be applied when many correction factors have to be found (the search space 
becomes too large) and/or when the correction has to be done on-line (i.e., when it depends on 
many continuously varying parameters). 
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The works in [6] [8] [9] [10] [11] aimed at finding the correction factor using the pdf of the L- 
value. The method of [6], based on a Gaussian model of the L- value fails to capture properties of 
non-Gaussian pdfs while [8] draws general conclusions about the suitability of linear correction 
but relies on simulation to find the correction factor. [9] [10] [11] rely on the minimization 
of a functional of the pdf which requires numerical integration as in most cases the analytical 
solutions are not available. The drawbacks of [9] is that the pdf has to be known or estimated 
and the functional in the optimization problem is not related to any performance criterion. This 
disadvantages were recently removed in [10] [1 1], where the correction factor was formally found 
via maximization of the so-called generalized mutual information (GMI) between the L-values 
and the corresponding bits. Then, even if the pdf is not known, the Monte-Carlo integration can 
be implemented. While this approach was (experimentally) shown to improve the performance of 
BICM receivers operating with the capacity-approaching codes, it does not explicitly address the 
problem of minimizing the error probability of the optimal maximum-likelihood (ML) decoder. 

In this paper we explicitly aim at the minimization of the probability of error in ML decoders, 
which results in a novel correction principle and provides a new insight into correction of the 
L-values. Our problem is formulated in the domain of the cumulative generating function (CGF) 
of the L-values. As their calculation is simpler than finding the pdf, in many cases we will be 
able to avoid explicit numerical integration. We find a simple correction principle which says 
that the correction factor should be equal to the twice of the so-called saddlepoint of the CGF, 
which is the real argument minimizing the CGF. Finding the saddlepoint requires solving a 
simple non-linear equation which, in many cases, may be even found analytically. 

The paper is organized as follows. The definitions and notation are presented in Sec. II and 
the new correction principle we propose is explained in Sec. III. A detailed illustration of our 
analysis in shown in Sec. IV on an example of correction of the L-values in the BICM receivers 
operating in the presence of interference. 

II. Model 

We consider a scenario where a codeword of N bits c = [ci, C2, . . . , c^v] is sent over a binary- 
input memoryless channel with known transition probability given by the pdf PY„\c„iyn\cn) ■ 
Upon reception of yi,...,yN, in order to minimize the probability of detection error, the 
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decoder decides in favour of the codeword that maximizes the likelihood of the observation, i.e., 

N 

c = argmax^gc Yl Py„\c„ (Vn | c„) , (2) 

n.=l 

where C is the code (i.e., set of all codewords). 

Using Bayes' formula py„|c„(|/n|c„) = i^c„|y„(c„|?/n)py„(z/n)/i'c„(cn) in (1), and knowing that 
PcnlYA'^nlVn) = 1 — -Pc„|y„(l — c„|?/„), wc obtain a useful alternative expression of the aposteriori 
probabability Pc„\Y„{cn\yn) = e'" '^"/(l + e'"). It transforms (2) into the decoding based on the 
L-values 

TV 

c = argmax^gc ^ ^nCn, (3) 

n=l 

where the terms independent of c were removed from the maximization in (3). 

The L-values /„ are modelled as random variables L„ and if they are calculated exactly as 
defined in (1) their pdf satisfies the so-called consistency condition [12, Sec. Ill] 

pl„ic„(^|o) 

The so-called symmetry condition [12, Sec. Ill] 

Pl„\c,Mc) = pl„|c„(-/|1 - c) (5) 

simplifies the analysis and, although it does not have to be always satisfied (it depends on the 
conditional pdf py„|c„(2/n|cn)), it niay be forced by a pseudo-random scrambling of the bits c„ 
prior to modulation, followed by the change of the sign of the L-values /„ if the bit was negated 
[13]. Thus, from now on, we assume that this condition is always satisfied. 

Rewriting (4) as pi^|c^(/|l)e~'/^ = e'^^PL„|c„(^|0) and using (5) yields what we call a 
consistency-symmetry condition 

PL„|c„(/|0)e'/^ = e"'/VL„|c„(-/|0). (6) 

A. Mismatched decoding and correction of L-values 

In practice, the calculation of some L-values via (1) may be inexact because i) the model 
PYn\Cn{yn\cn) is uot accuratc, ii) its parameters are not well estimated, or iii) the likelihood is 
calculated using simplifications introduced to lower the computational effort. In general, these 
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effects may be represented as if a "mismatched" likelihood q{yn,Cn) 7^ PYnlcAUnlcn) was used 
in (1) yielding the "mismatched" L-values [14] [10] 

In = ^Og- — . (7) 

?(2/n,0) 

If the mismatch is ignored, that is, /„ is falsely assumed to be identical with /„, the receiver 
will operate in a suboptimal fashion because /„ cannot be transformed into the likelihood 
PY„\Cuiyn\cn) ■ Nevertheless, if the conditional distribution of L (that models the 

mismatched metrics /) is known, we might calculate a post-processing or "correction" function 
[8] [9] 

PL \c (^|1) 

r(/)=log ^^"l^" , (8) 

and then, calculated the "corrected" L- value as = (/„) . 

In general, the effect of the mismatch cannot be eliminated, i.e., ^ In- However, using 
l'^ instead of /„ should improve the performance of the decoder, because l'^ does represent the 
likelihood of the observation /„ conditioned on the bit c„. We also immediately conclude that if 
the L- value is matched, i.e., its pdf satisfies (4), no correction is necessary as we obtain f^{l) = I, 
that is, l^^ = In- 

Example 1: If we assume the Gaussian form of the pdf p^|(^^(/|0) = + a^) = pi^^(j^{—l\l), 
where 

v^(/;a2) = -^expf--^V (9) 



V2^a V 2ct 

= -^L„\c^=o{Ln} is the negated mean of L„ and = ^L„\Cr.=o{Ll} - E|^|c„=o{^"} 
variance, the correction function 

fU) = ^ ■ / = ■ 1. (10) 

is linear and the correction factor is determined by the double of the ratio of the mean and the 
variance of the L-value. 

The Gaussian model from the above example was used in [6] to justify the correction based 

on 

The resulting correction = a ■ I has an appealing simplicity and in many cases (treated 
mostly in the area of iterative decoding) /'^(/) was observed to be relatively well approximated 
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by a linear function [8] [9] [15]. Therefore, using /(/) = al instead of f^{l) (that is, in general, 
nonlinear) often provides the satisfactory correction effect [8] [9] [15] and, when compared to 
non-linear functions /'^(/), has the advantage of the implementation simplicity (scaling only) and 
a relatively simple design (one parameter needs to be found). Then, the main question is how 
to choose the correction factor a. 

The contributions in [16] [15] attempted to answer this question making f{a) = a ■ I "close" 
to f^{l). In particular, [16] find the correction factor via the weighted least-square fit (WLSF) 
to the function f'^{l) 

d^L^P = argmin^E^^i^^^o {|r(Z„) - aZ^I^} 

/oo 
PL^icAmim-cy-iYdi. (11) 

This criterion, however, is not associated with the performance of the decoder. Moreover, since 
we use the function the form of the pdf |p (/|0) has to be known or explicitly estimated. 

In the recent works [10] [11], the correction factor was found through maximization of the 
generalized mutual information (GMI) [14] between the mismatched L-values and the corre- 
sponding bits. Assuming (5), this criterion boils down to solving the following optimization 
problem 

;GMI _ „^„^- Jl — , /I I ^La\ 



argmin,Ei^l^^^o{log2(l + e^")} 

/oo 
Pi^lcJ/|0)log2(l + e'")dL (12) 
-oo 

The minimum is reached when the derivative of the integral in (12) goes to zero 

^E,„|c„=o{Ml + e"^")} = 

Jl„\cM)y^^^ = ^ ^^^^ 



.oo ' cosh(a|) 

While it is argued (and demonstrated on examples) in [10] [11] that the maximization of GMI 
should improve the performance of the capacity-approaching codes, the correction criterion (12) 
does not relate directly to the performance of the ML decoder we are interested in. 

Moreover, solving (12) requires the numerical quadratures as the the logarithm within (12) or 
the hyperbolic cosine within (14) will resist analytical integration. If pi^^(j^{l\Q) is not known, 
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the Monte-Carlo integration (simulations) may be used to calculate the integral in (12) or (14) 
but such an approach precludes the on-line (i.e., model-based) correction. However, it is still 
simpler to implement than (11) because we do not need to know the function f^{l). 

B. PEP 

To describe the behaviour of the ML decoder (3) based on the corrected L-values 

N 

c = argmax^gc • • c^, (15) 

n=l 

we will use the pairwise error probability (PEP) defined as the probability of detecting codeword 
c when sending the codeword c. 

Assuming that the code C is linear and (5) holds, instead of calculating the PEP for all 
pairs (c, c) it is enough to calculated the PEP for all c ^ c assuming the all-zeros codeword 
c = [0, . . . , 0] was sent, that is, the probability of the event c — c 

N 



PEP{{an}^=i, c) = Pr 1^ a„, ■ L„ ■ c„ > a„ ■ L„ • c„| (16) 
= Pr|^L^c„>ol (17) 



. n=l 



[PZ^,|c.(^|0)]"^---^bz5,|c.(^|0)]'"d; (18) 

where * is the convolution operator. 

This notation emphasizes that the PEP depends on the correction factors {an}n=i the 
codeword c. 

If we denote by {nk}f.^i, the set of indices such that c^^. = 1, where d is the Hamming weight 
of c, the PEP (18) can be written as 

/"OO 

PEP({a„}li, c) = / PL.^ 1^ (/|0)^...^p^. |c„,(^|0)d/., (19) 

that is, it depends solely on the pdfs of the L-values indexed by {nk}f=i. 

We also note quickly that multiplying all the L-values In in (15) by a„ = a cannot change 
the decoding results so, in such a case, the linear correction is useless if ML decoder is used. 
However, it still may be useful if another type of decoding is applied. For example, iterative 
decoders (e.g., of the turbo codes or LDPC codes) may benefit from such a correction. 
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III. PEP-MINIMIZING CORRECTION 

Now, we want answer the question: how to choose the correction factors {an}n=i so that the 
error of the decoder that uses the corrected L-values /Jij = a„ ■ is minimized? 

From the previous discussion we conclude that, in order to improve the performance of the 
decoder, we should find {an}n=i to minimizes the PEP ({a„}^^]^,c) in (18) for any codeword 
c. Thus, we have to solve the following optimization problem 

and its solution should be independent of c because we want to apply the correction factors to 
all L-values prior to decoding and we do not know which error (c — t- c) will occur. 

At first sight, the problem may appear intractable due to the dependance of the PEP on various 
c, each resulting in a different set of L-values indexed by {n^jf^j^ which are then convolved as 
per (19). 

A. Two-state mismatch 

Before attacking the problem (20) we will analyze a simpler case of a two-state mismatch, 
where A^^i of the L-values are independent identically distributed (i.i.d) and mismatched and 
the remaining N — Ni are i.i.d and matched. In this case, all the mismatched L-values will be 
multiplied by the same correction factor a, that is, a„ = a,n = 1,. . . ,Ni and the matched 
L-values will remain unaltered. 

Since, we do not known a priori the indices {nk}i=i, we do not know a priori how many 
mismatched metrics will affect the PEP. We thus assume initially that among the L-values 
affecting the PEP calculation, di are mismatched and ^2 L-values are matched. This specifies 
(19) as follows 

{di d2 
fc=l k=l 

]*di r -1 *d2 

^ [pL\cm\ d/, (21) 

where [f{l)]*'^ is a rf-fold self-convolution of /(/) and we emphasize that the PEP depends 
uniquely on one parameter a. 
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We want minimize PEP(q;) (21) for any di and d2 thus, the solution of 

a = argmin^PEP(a) (22) 

should be independent of di and ^2- 

Example 2: Assume that the bits c„ are sent using a binary phase-shift keying (BPSK) 
modulation so 

VY^\cAyn\Cn) = ^ (^Vn " (2c„ - 1); , (23) 

where 7 has the meaning of the average signal-to-noise ratio (SNR) and \l/(-) is given by (9). 

To calculate the L-values via (1) using (23) we need to know the value of 7 and we assume 
that its estimate 7 7^ 7 is used for the first A''! L- value as Z„ = 4y„ ■ 7, / = 1, . . . , A''i so these L- 
values are mismatched. The exact value of 7 is used for the remaining L-values = 4y„ ■ 7, / = 
A'^i + 1, . . . , and these L- value are matched. It is straightforward to see that the pdf of the 
mismatched L-values is given by p^„|c„(^|0) = ^{1 + 47; 87^/7) while the pdf of the matched 
L-values by Pl„|c„(/|0) = ^(/ + 47; 87) [17]. 

Since all the L-values affecting the PEP are Gaussian, the result of their convolution is also 
Gaussian and we can write (21) as 

PEP(a) = Q ( ^ilt±iL=) = Q ( V2_^il±4^^ , (24) 

where Q{x) = ^ /^^ exp(-tV2)dt. 

Verifying that (24) is convex with respect to a and setting its derivative to zero yields the 
global minimum of (22) given by a = 7/7. 

Note that, as required, the correction factor is independent of di and ^2 thus the PEP is 
minimized independently of the error event c — )■ c. 

We can also immediately see that = a ■ = Inl/l = ^Unl, that is, /J; = and we 
recover the matched L-value. Of course, if we knew that 7 should be used, we would not use 
7 to calculate the L-values, in the first place so this example illustrates only the principle of 
correction. 

B. Approximation of the PEP 

To apply the PEP-minimization principle (22) in a general case, we must be able to find the 
PEP for arbitrary distributions of the L-values. Since, in general, this cannot be done exactly in 
an analytical form, we will turn to approximations. 
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Defining 

di d2 di d2 

= Y,Ll + J2Lk = J2aLk + Y^ Lk (25) 



k=l k=l k=l k=l 



we can write (21) as PEP(a) = Pr {L^ > O}. 

Then, the Bhattacharyya upper bound for the PEP is given by [13] [18] 

PEP(a) < PEP™ (a) = e'^^^^^ (26) 

where ks(s) is the cumulant generating function (CGF) of 

«s(s) = diKic{s) + c?2«l(s) = diKi{sa) + d2KL{s), (27) 



and 



/oo 
PL|o(/)e^'d/ 
-oo 



(28) 



«:^(.) = logE^l^^o{e^^} (29) 
Ki.{s) = logE^|^^o{e^"^} = K-^{sa). (30) 

In (26), s = argmin^g^K2(s) is the so-called saddlepoint [18] of ^^(s), which is unique 
because the CGF is always convex. 

The bound (26) was shown in [ . ■ ] to be quite loose and a much more accurate estimation of 
the PEP can be obtained using the so-called saddlepoint approximation (SPA) [18] [19] [20] 

PEP(a) ^ PEP(«) = ^j^^^ - (31) 

However, minimization of (31) is quite difficult due to the implicit dependence of s on a, 
therefore, for simplicity we opt for minimization of the upper bound (26). Nevertheless, even if 
the correction factors a minimizing of PEP™ (a) and PEP(a;) would not be the same, we expect 
them to be similar as the exponential term e''^^^^ dominates both expressions. 

Theorem 1: The upper bound for the PEP in (26) is minimized setting the correction factor 
to d = 4^, where si and §2 are the saddlepoints of the matched and mismatched L- values, that 
is, K,'^{si) = and Kl^ih) = 0. 

Proof: We start bounding (26) as 

minPEP™(a) = e™""'" = e™™">» (^i'^£(*")+'^2''i(^)) (32) 

a 

> g<ilK£(si)+rf2ftL(s2) 
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where (33) is the global minimum of (26). 

We can see that the exponent in (32) mma{diKi{sa) + d2KL{s)^ reaches its global minimum 
for sa = Si and s = §2, that is, when a = S1/S2', this means that the saddle point of ks(s) is 
s = S2. ■ 

As required, the bound on the PEP is minimized independently of di and ^2- 

C. Arbitrary mismatch 

We are now ready to abandon the context of the two-state mismatch and may extend the 
previous result to the case treated in (20). 

Let = ^^^^ an-Ln- Cn has the CGF given by kj^{s) = Yln=i '^iS^' ' ^n, where k„(s) 
is the CGF of the L-value L„ conditioned on Cn = 0. Define the upper bound on the PEP (20) 

as 

PEP({a„}li; c) < PEP™({«Jli; c) = (34) 

Proposition 1: The linear correction factors that minimize the upper bound on PEP (34) are 
given by d„ = where s„ is the saddlepoint of the CGF and Sq > is chosen arbitrarily. 

Proof: As in the proof of Theorem 1 we write 



> eE™=i«;i,„(^n)-Sn_ (35) 



min PEpUB({a4;y^,;c) =e"'"-^->^^i^^"-'^^"^"""^""^ (35) 



The global minimum is reached when s„ = n = 1, . . . ,N. This produces equations with 
A^ + 1 variables and since s and §1, . . . , sn are positive we may arbitrarily fix s = sq. ■ 
Remark 1: Although sq may be chosen arbitrarily (recall that the multiplication of all L- 
values by the same correction factor does not change the ML decoding results), it is reasonable 
to use So = |- This is because the saddlepoint of the matched metrics equals s„ = | [18] and 
their correction factor is then given by d = 1, that is, no correction is necessary as we would 
expect it. Thus, the simple rule consists in doubling the saddlepoint of the L-values' CGF 

an = 2Sn. (37) 

Remark 2: We recall that if we want to use the pdf conditioned on C„ = 1, (/|1), 
instead of pi^^(j^{l\0), the saddlepoint in negative s„ < 0, but then also for the matched metrics 
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So = — |. Thus, to take both cases into account we might reformulate (37) as 

«n = 2|s„|. (38) 

Remark 3: Since the CGF of the L-values (after correction) is equal to (s) = K^^(sa„), 
the saddlepoint of (s) is given by = Sq. That is, the saddlepoint of the CGF of each 

corrected L-values is equal to sq = |- 

D. Relationship to the GMI -maximizing correction 

Let us compare now the correction factor defined using (37) to defined in (14) where 
the maximization is obtained finding the zero of its derivative 

roc, 1 _ a{ 

oo 7 

where (40) is obtained from (39) after the change of variables using the pdf of the corrected 

L-value L'^ = aL, i.e., Pic\ciA^) ^ ■Pl=|o('^ ""^ ' ^l^)- 

On the other hand, the condition we derived in (37) states that the saddlepoint of i^ic{s) 
should be equal to |, which may be written as follows 

/oo 
p^c|c(/|0)e5./d/ = 0. (42) 
-oo 

Since cosh(|) in the denominator of (40) is symmetric, we can see that if pj^c|c(/|0)e^ is 
symmetric, both (40) and (42) are satisfied. This is also the condition we derived in (6) and can 
be interpreted as follows: if the optimal correction function f^{l) is linear, i.e., after the linear 
correction the L-value satisfies the symmetry-consistency condition (6), both criteria yield the 
same correction factor. In general, however, they need not be the same. 

IV. Example of application: Correcting the interference effects 

Consider a BPSK transmission, when the sent symbols x„ = 2c„ — 1 pass through a channel 
affected by additive Gaussian noise and a BPSK interference 

Vn hn ' •^n ~l~ ~l~ Qn ' (43) 
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where /i„ G 7^ is the known gain of the channel, Zn is a zero-mean Gaussian signal with known 
variance a1 = Nq/2; dn G { — 1, 1} is the BPSK-modulated interference signal received with the 
gain Qn E IZ. We define also the SNR and the signal-to-interference ratio (SIR), respectively, as 
SNR = hi/No and SIR = hl/gl 

Although using (1) it is relatively simple to calculate the L- values in this case as 

" exp(- fa--^;---^-)^ ) + exp(- (y-^i-t'-r y ^^^^ 

z z 

for the purpose of our examples, the receiver ignores the interference, thus assumes Qn = 0, and 
then, from (44) we obtain the L-value 

In = (45) 

which is mismatched due to assumed absence of interference. 

To apply the correction principle we derived, we need to calculate the CGF of L = ^ ■ Y, 

^ z 

conditionned on sending the bit Cn = 0, k,i{s) = ky{^s), where 

'^z 

Ky{s) = -Ks + ]^ay + log Qe^-^" + ie-^-^"^ . (46) 

2 

Then, finding the saddlepoint of Ky(s), as Ky(s) = 0, the saddlepoint of k^(s) equals to -^s 

~ 2 

thus, the correction factor applied to the /„ is given by (37) a = and the correction boils 
down to the calculation 

II = 2s ■ Vn. (47) 

Note that, instead of calculating the saddlepoint of ^^(s) and correcting /„, we might directly 
calculate the saddlepoint of Ky(s) and apply it to the observation ?/„. This is possible, of course, 
because the L-value /„ is a scaled version of cf. (45). 

To find the saddlepoint s we differentiate (46) to obtain the following saddlepoint equation 

hn- ■ s = gn-taiah{gn- s), (48) 

whose graphical interpretation as the intersection of the right-had- and left-hand sides is shown 
in Fig. 1. 
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While (48) cannot be solved in a closed-form, we may obtain approximations in particular 
cases. Namely, for SNR — )■ (i.e., when cr^ — )• oo), we easily see that s — )■ so, using the 
linearization tanh(x) ~ x (shown as a dashed line in Fig. 1), we obtain 

= = (49) 

where cr'^+in is the variance of the noise and interference. The corresponding correction factor 
is given by a° = Note that using /i = -E^ |c'„=o = ^ ^^^^ cr^ = Var{L„,} = ^ in (10) 

yields exactly the same results a*^™'*'* = 

This means that for the low SNR, when the noise "dominates" the interference, the effect of 
noise and the interference can be modelled as a Gaussian variable with variance cr^+i„; this is 



December 1, 2011 



DRAFT 



15 




10 12 



Figure 2. Correction factors qP'^\ a q^^^'' obtained for various values of SNR and SIR. The dotted line show the value of 
a'^'^, cf. (58) for various combinations of 2 < di , d2 < 8. 



often done in practice [21] and the corrected L- values, in this case, would be calculated as 



c 2 /in ■ y„ 



(50) 



N+I,n 



For high SNR, i.e., when SNR oo (i.e., cr^ — 0), we observe that s — — oo so taking 
advantage in (48) of the saturation of tanh(oo) = 1, the saddlepoint is given by^ 



and the corresponding correction factor by 

= 1- gn/K 



(51) 



(52) 



'We assume p„ < /i„, i.e., the interference is weaker than the desired signal. 
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The L-values in this case would be calculated as 

that is, the interference-related term decreases the gain of the useful signal. Intuitively, this can 
be explained as follows: for high SNR, the interference can be "distinguished" from the noise 
and becomes the part of the transmitted constellation, i.e., sending bit c„ = 1, we will effectively 
be able to make a difference between /i„ — gn or hn + Qn- Moreover, for high SNR the symbol 
that is the most likely to provoke the error is the one closest to the origin, that is hn — gn- This 
leads to assuming that BPSK symbols are sent over a channel with gain /i„ — cf. (53). 

For < SNR < oo, the saddlepoint can be obtained numerically solving (48). For example, 
we might use the Newton-Raphson method 

= _ 1) _ ^IVV^^ i = l,2,...,Ux (54) 

Ky{s{l - 1)) 

where 

4(^) = ^' + — rl^ (55) 

cosh [g ■ s) 

and s(0) is the appropriately chosen starting point for the recursion, e.g., s(0) = max{s°°, s"}. In 
this work, we used /max = 2 so a small computation load in incurred due to the on-line calculation 
of the the correction factors; alternatively, the correction factors might be interpolated using a 
table precalculated for different values of SNR and SIR. 

Since ^ > s (cf. Fig. 1) we can also immediately conclude that the correction factor always 

2 

satisfies a = < 1, that is, ignoring the interference, our reliability metric is too "optimistic" 
and must be scaled down. On the other hand, since s > s*^, if the mismatched L-value is calculated 
using the Gaussian approximation of the interference, that is using (50), the correction would 
be a > 1. That is, the Gaussian approximation is too "pessimistic". 

We show in Fig. 2 the values of the optimal correction factors for different values of SNR 
and SIR. For high SNR, as predicted by (52), a = a°° = l-g^ = l- io-sib./2o.^ example, 
when SIR = 6 dB, a = 0.5. We also show the value of a^^^, cf. (12) and we see that it is 
quite close to a. Both factors are increasing with SINR as the case SINR — )■ oo corresponds 
to the assumed absence of the interference, that is, d — t- 1. 

It is also interesting to compare the obtained correction factors to those that might be obtained 
minimizing the actual PER To make it possible we again analyze the two-state mismatch from 
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Sec. III-A. That is, we assume that di L-values affecting PEP are mismatched as in our example, 
while ^2 L-values are matched, that is no interference is present during the transmission. 

From (45) we easily deduce the distribution of the mismatched L-values L and the matched 
ones L as 

PLicm = ^ (/ + /ii; a^) + $ (/ + fx2; a')] 

PL|c(/|0) = ^(/ + /io;f^') (56) 
where /i^ = f., = /.o = and = % 

z z z z 

Since the convolution of di corrected mismatched L-values L'^ with d2 matched L-values L 
is a mixture of Gaussian function, we easily obtain the analytical expression for the PEP of 
two-state mismatch (2SM) 

2^t^V^; V crVd.a^ + d2 J' 

and we define 

^2SM ^ argmin^PEp2^^(a), (58) 

which is shown in Fig. 2 for various combinations of 2 < di, < 8 as, now depends on 
di and d2. We can appreciate that the optimal values a^^^ are very close to a resulting from 
the saddlepoint-based criterion we proposed, which is obtained without restrictive assumptions 
on the two-state structure of the mismatch. 

For completeness we also show the results of WLSF defined in (11). We can expect it to 
provide reasonable results when f^{l) is almost linear, that is when the pdf Pi^|c^(/|0) is close 
to Gaussian. This happens when interference is dominated by the noise, i.e, for low SNR and 
high SIR and then, as we can see in Fig. 2, a^^^^ is close to the PEP-minimizing values a^™. 
On the other hand, when SNR is high and SIR low, they results obtained are far from the optimal 
values. 

Now, to take our solution out of the PEP-related consideration, and to verify how the correction 
affects the performance of a practical decoder, we consider a case where a block of = 1000 
bits is encoded using the convolutional encoder {15, 17}s of rate | [__] and the turbo code 
{1, 15/13}8 [/] with rate | (obtained via puncturing of the parity bits). 
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We recall that for identically distributed L-values, the performance of the ML decoder cannot 
be improved via linear correction. Thus, to show the eventual advantage of the correction, we 
assume that the channel gains /i„ are unitary-energy, Rayleigh variables, so the average SNR 
SNR = is used to characterize the channel. The correction factor has to be found for each 
value of hn that is assumed perfectly known at the receiver. The average signal-to-interference 
is set to SIR = 6 dB for CC and SIR = 8 dB for TC; for these values, the measurable BER 
results can be presented in the same range of SNR. The results of decoding (Viterbi decoding 
for the convolutional code and turbo decoding with five iterations for turbo-code) in terms of 
bit-error rate (BER) are shown in Fig. 3. We also show the results of the decoding using true 
L-values, i.e., L-values obtained via (44). 

The comparison with the GMLbased correction, is in order even if, as shown Fig. 2, the 
correction factors are similar to those obtained using our method. Note that, unlike in our 
method, solving the GMLbased problems (12) or (14), the numerical integration is needed and 
the solutions turns out to be sensitive to the number of points of the numerical quadrature 
(we used Gauss-Hermite method with 40-100 points). Due to these numerical issues, beyond 
SNR = 15 dB and particularly for large SIR we were not able to find the solution of (12) in 
the interval a E (0,1) (where it must belong). These practical aspects also speak in favour of 
the correction based on the saddlepoint equation (48), where no integration was necessary and 
the solution was readily obtained using (54). To go around these implementation problem, for 
instantaneous SNR \hn\'^/NQ larger than 15 dB (i.e., where a and d™' are quite similar, cf.2) 
we used a instead of d;™^ We also opted for an off-line solution: we pre-calculate a table of 
^GMi ^Qj. jj^^j^y values of SNR and SIR and, during the simulations, for each instantaneous SNR 
and instantaneous SIR |/inP/fi'^ the value of is interpolated from the table. 

We can see that the correction results based on our method or on the GMI approach are 
similar and bridge partially the gap to the results based on the true L-values. The performance 
improvement is particularly notable for high average SNR, which is consistent with the results 
of Fig. 2 where the most significant correction (small values of d) are obtained for high SNR. 

In Fig. 3 we also show the results of the correction derived assuming that the interference is 
Gaussian yielding the correction factor d° = . Since d° is independent of the channel gains 
hn, it is common to all the L-values and thus irrelevant to the performance of ML decoder. For 
this reason, the results obtained with d° and with d are identical for CC, where the ML (Viterbi) 
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Figure 3. Simulation results for the rate-i convolution code (CC) and rate-| turbo code (TC) obtained using the L-values 
without correction (45) (i.e., a = 1), the L-values with optimal correction we propose (calculated solving (48)), the true L-values 
(44). The results obtained from L-values corrected using A™' are shown with the filled markers. For CC we use ML decoding, 
so the results based on the Gaussian model of the interferences (a^), (50) are identical to those obtained without the correction 
as discussion at the end of Sec. IV. 



decoder is used; they are thus not shown in Fig. 3. On the other hand, the turbo decoder, based on 
the iterative exchange of information between the constituent decoders, depends on the accurate 
representation of the aposteriori probabilities via the L-values [ !()]. It is, therefore, sensitive to 
the scaling and the correction with d° improves the results comparing to those obtained without 
correction. 

V. Conclusions 

In this work we propose a new method to find the linear correction of the mismatched L- 
values. Aiming at the minimization of the probability of errors made by the maximum-likelihood 
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decoder, we find that the correction factor equals to the twice of the saddle point of the cumulant 
generating function (CGF) of the L-values. Our method is shown to bear similarities to the one 
based on generalized mutual information proposed before but is simpler to implement because, 
working in the domain of CGF, the numerical integration may be avoided. We illustrate our 
findings with the analysis of the BPSK transmission in the presence of the interference where 
the correction factor clearly improves the performance comparing to the mismatched metrics 
without correction. 
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